Picture for Hao Peng

Hao Peng

Beihang University

Learning to Explore: Policy-Guided Outlier Synthesis for Graph Out-of-Distribution Detection

Add code
Feb 28, 2026
Viaarxiv icon

Heterophily-Agnostic Hypergraph Neural Networks with Riemannian Local Exchanger

Add code
Feb 28, 2026
Viaarxiv icon

GLM-5: from Vibe Coding to Agentic Engineering

Add code
Feb 17, 2026
Viaarxiv icon

Kelix Technical Report

Add code
Feb 12, 2026
Viaarxiv icon

Dialogue Model Optimization via Agent Game and Adaptive Tree-based GRPO

Add code
Feb 09, 2026
Viaarxiv icon

WildReward: Learning Reward Models from In-the-Wild Human Interactions

Add code
Feb 09, 2026
Viaarxiv icon

Low-Light Video Enhancement with An Effective Spatial-Temporal Decomposition Paradigm

Add code
Feb 09, 2026
Viaarxiv icon

Do We Need Adam? Surprisingly Strong and Sparse Reinforcement Learning with SGD in LLMs

Add code
Feb 07, 2026
Viaarxiv icon

ReBeCA: Unveiling Interpretable Behavior Hierarchy behind the Iterative Self-Reflection of Language Models with Causal Analysis

Add code
Feb 06, 2026
Viaarxiv icon

Faithful Bi-Directional Model Steering via Distribution Matching and Distributed Interchange Interventions

Add code
Feb 05, 2026
Viaarxiv icon